Consensus clustering applied to multi-omics disease subtyping

نویسندگان

چکیده

Abstract Background Facing the diversity of omics data and difficulty selecting one result over all those produced by several methods, consensus strategies have potential to reconcile multiple inputs produce robust results. Results Here, we introduce ClustOmics, a generic clustering tool that use in context cancer subtyping. ClustOmics relies on non-relational graph database, which allows for simultaneous integration both results from various methods. This new conciliates input clusterings, regardless their origin, number, size or shape. implements an intuitive flexible strategy, based upon idea evidence accumulation . computes co-occurrences pairs samples clusters uses this score as similarity measure reorganize into clusters. Conclusion We applied multi-omics disease subtyping real TCGA ten different types. showed is heterogeneous qualities partitions, smoothing reconciling preliminary predictions high-quality clusters, computational biological point view. The comparison state-of-the-art consensus-based tool, COCA, further corroborated statement. However, main interest not compete with other tools, but rather make profit when no gold-standard metric available assess significance. Availability source code, released under MIT license, obtained are GitHub: https://github.com/galadrielbriere/ClustOmics

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering Gene Expression Regulators: New Approach to Disease Subtyping

One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA) w...

متن کامل

Entropy-based Consensus for Distributed Data Clustering

The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...

متن کامل

Lecture 9 : Multi Omics Clustering December 19 , 2017

Omic is a term used to describe a field of study in biology that utilizes a certain type of biological data (e.g. genomics is the study of genome), while multi omics is the usage of several types of omics data. During the course we’ve mainly discussed mRNA data. In the introduction lecture other types of biological data were shown, protein and DNA data (referred to as proteomics and genomics). ...

متن کامل

Consensus Clustering + Meta Clustering = Multiple Consensus Clustering

Consensus clustering and meta clustering are two important extensions of the classical clustering problem. Given a set of input clusterings of a given dataset, consensus clustering aims to find a single final clustering which is a better fit in some sense than the existing clusterings, and meta clustering aims to group similar input clusterings together so that users only need to examine a smal...

متن کامل

Using “omics” and integrated multi-omics approaches to guide

37 38 Running title: Using multi-omics for probiotic selection 39 40

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: BMC Bioinformatics

سال: 2021

ISSN: ['1471-2105']

DOI: https://doi.org/10.1186/s12859-021-04279-1